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Abstract 

A celebrated theorem of Friedgut says that every function / : {0, 1}™ — > {0, 1} can be 
approximated by a function g : {0,1}" — * {0,1} with ||/ — g\\\ < e which depends only 
on e 0( - I fl t ^ variables where // is the sum of the influences of the variables of /. Dinur 
and Friedgut later showed that this statement also holds if we replace the discrete domain 
{0, 1}" with the continuous domain [0, 1]™, under the extra assumption that / is increasing. 
They conjectured that the condition of monotonicity is unnecessary and can be removed. 

We show that certain constant-depth decision trees provide counter-examples to Dinur- 
Friedgut conjecture. This suggests a reformulation of the conjecture in which the function 
g : [0, 1]" — > {0, 1} instead of depending on a small number of variables has a decision 
tree of small depth. In fact we prove this reformulation by showing that the depth of the 
decision tree of g can be bounded by e°^ If ^ >. 

Furthermore we consider a second notion of the influence of a variable, and study the 
functions that have bounded total influence in this sense. We use a theorem of Bourgain 
to show that these functions have certain properties. We also study the relation between 
the two different notions of influence. 
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1 Introduction 



The notion of the influence of a variable on a Boolean function over a product probability space 
plays an important role in computer science, combinatorics, statistical physics, economics and 
game theory (see for example [KKL881 lBKK+921 iFnUl IBou021 lFn98l ILMN93] ). It is usually 
the case that the functions whose variables satisfy certain bounds on their influences have 
simple structures. Although results of this type are studied extensively (see for example [Bou02 , 
Fri98, Hat06] IFKN02] ). still many basic questions are open (A handful of them can be found 
in [Fri04] ). 

For a set X and given a function / : X n — * R, roughly speaking the influence of the i-th 
variable measures the sensitivity of / with respect to the changes in the i-ih coordinate. There 
are numerous ways to measure this sensitivity and each gives rise to a different notion of the 
influence. It is usually the case that certain bounds on the influences imply some structure on 
/. For example there is a theorem of Friedgut (Theorem lAl below) in this flavor that concerns 
the case X = {0, 1}. This theorem has many applications and provides a tool for proving 
hardness of approximation results using PCP reductions, and also serves as a tool in machine 
learning [DS05, KR03, DRS05, OS06J. Dinur and Friedgut later conjectured that this result 
holds in the continuous setting of X = [0, 1] as well. In this article we give a counter-example 
to their conjecture. Fortunately this does not mean that there is no continuous version of 
Friedgut's theorem. Indeed we characterize all functions that satisfy the conditions of Dinur- 
Friedgut conjecture, and prove that such functions enjoy very nice and simple structures. 

A graph on n vertices can be thought of as an element in {0, l}^ 2 ), where each coordinate 
corresponds to one of the (J) possible edges. A graph property on n vertices is a subset of 
graphs on n vertices which is invariant under the permutation of vertices. Hence such a graph 
property can be formulated as a function / : {0, l}^) — » {0, 1} which is invariant under certain 
permutations of coordinates: the ones that come from the permutations of the vertices of a 
graph. Let fJ- p (f) denote the probability that f(G(n,p)) = 1, where G(n,p) is the random 
graph on n vertices where each edge is present with probability p. It follows from a theorem 
of Margulis [Mar 74] and Russo [Rus81] that is closely related to the influences of variables 

for /. The derivative plays an important role in theory of random graphs and statistical 
physics. A bounded derivative shows that the property changes smoothly with respect to the 
changes in p. Friedgut [Fri99] used the connection to the influences to prove a beautiful theorem 
that characterizes all graph properties that satisfy < C for a constant C . He conjectured 
that being a graph property is irrelevant here and his characterization holds for every function 
/ : {0, l} n — > {0,1}. Bourgain [Fri99] proved a weaker version of Friedgut's conjecture. We 
will observe that Bourgain's proof works in the more general setting of functions / : X n — > R. 
Inspired by this we give a conjecture that is similar to Friedgut's conjecture but is for the 
general setting of X n . We show that a positive answer to this conjecture would settle one of 
the conjectures in [Fri04]. 

2 Definitions and known facts 
2.1 Notation 

Throughout this paper, following Hardy notation, C always denotes a universal constant, but 
not necessarily the same in different statements. Let [n] = {1, . . . , n} for every natural number 
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n. For two functions /, g : R + — ► R + , we write / = O(g) if there exists a constant C such that 
/(x) < Cg(x), for every x > C. We also write / = o(g) if hrn^oo f(x)/g(x) = 0. Sometimes 
we write / = O a (g) or / = o a (g i ) to specify that the involved constants may depend on a. 
When there is no subscript, the constants are universal. A function is called Boolean if its 
range is {0, 1}. 

Let (X, J 7 , n) be a probability space, and let X n be the product space endowed with the 
product probability measure /U ra . Our goal is to study measurable functions / : X n — > R. 
Sometimes we consider / as a random variable and write E[/] (the expected value of /) for 
/ f{x)dfi n (x). The variance of / is defined as Var[/] = E[/ 2 ] - (E[/]) 2 . For J C [n], and 
a G X, we denote by (J = a) the event {x : Xj = aVi G J}. Thus we may write E[/| J = a] 
to denote the conditional expectation of / on the event that all variables with indices in J are 
set to be equal to a. 

Suppose that X is an ordered set. Then a function / : X n — ► R is called increasing if 
Xi < yi for all i G [n], implies /(x) < f(y). 

2.2 Fourier- Walsh expansion 

Let (X, J 7 , /x) be a probability space, and let X n be the product space endowed with the 
product probability measure /J, n . Consider a function / : X n — > R. Then it is possible to find 
functions F 5 : X n R for 5 C [n] such that 

(i) We have / = E S c[n] 

(ii) For x = (xi, . . . , x n ) G X", the value of Fs{x) depends only on the variables {xj : z G S 1 }. 

(iii) We have J Fsd[i(xi) = for every i G S 1 . 

(iv) We have / F s F T d^{xi) = for every i€T\S. 

Then / = J2sc[n] is called the Fourier- Walsh expansion of /. Note that (iv) is implied by 
(iii) and from (i-iv) it is easy to see that 

ll/lll = / fd» n = E E / WTdvr = E / = E \\Fs\\l (1) 

SC[n]TC{n] S<Z[n] J SC[n] 

For uniform measure on X = { — 1, 1}, the Fourier- Walsh expansion becomes very simple. 
For S C [n], let : X n —>■ R be defined as ws : (xi, . . . , x n ) i-> riies x *- Then it is easy to 
see that for / : X n — > R, 

/ = E /(<S>S> 

5C[n] 

where 

7(5) = | /(x) Ws (x)d/i n (x). 

So here in the Fourier- Walsh expansion / = J2sc[n] we nave ^5 = f(S) w S, where f(S) is 
just a real number. 
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2.3 Influences 



Let (X, n) be a probability space, and let X n be the product space endowed with the product 
probability measure /i" . For any x = (xi, . . . , x n ) 6 X n , define 

Sj (x) ={y <=X n : yi = Xi Vi/j}, (2) 

or in other words Sj(x) is the set of elements that can be obtained from x by changing only 
the j-th. coordinate. For / : X n — * M, we want to define the notion of the influence of the 
j-th variable as a measurement for the dependence of the value of / to the value of the j-th. 
coordinate. Suppose that / is constant on Sj(x) for some x. Then for this particular x, the 
value of Xj is irrelevant to the value of /. Thus it is natural to define the influence of Xj as the 
probability that / is not constant on Sj(x). This gives rise to our first notion of the influence. 
Next consider the function / : [0, l] 2 — ► M defined as f{xx,X2) = 10xi + X2- With the above 
definition both x\ and X2 have influences 1. But note that changing the value of x\ changes the 
value of / more drastically than changing the value of X2 ■ The next definition of the influence 
captures this difference and is defined as the following. For a particular x, Sj(x) is endowed 
with the probability measure fj,, and Var s ^[/j measures the variation of / on Sj(x). Thus 
we can define the influence of Xj as E[Var s ^[/J]. Let us summarize the discussion in the 
following definition. 

Definition 2.1 Let f : X n — > R where X n is a product probability space with the product 
probability measure /j, n . 

• The influence of the i-th variable on f is defined as 

If(i) = Pr({x : / is not constant on Sj(x)}). (3) 

• The variational influence of the i-th variable on f is defined as 

I f (i) = E[Vav Si{x) [f}]. (4) 

The total influence and total variational influences on f are defined respectively as 

if = £r=i J /W and h = TJU hi 1 )- 

Remark 2.2 Note that ([3]) and (|4|) imply some measur ability conditions on / which we as- 
sume to hold throughout this article. Since we are dealing with probability spaces, for our 
purpose, without loss of generality we might assume that (X, /i) is the Lebesgue measure on 
[0,1]. Throughout the article when a discrete set X is considered as a measure space without 
mentioning the measure, the corresponding measure is assumed to be the uniform probability 
measure. ■ 

The influences are of particular interest when / is the characteristic function of a set, or in 
other words the range of / is {0, 1}. Note that for a function / : X n — * {0, 1}, we always have 
4//(j) < If(j). Indeed let 

n _j 1 /is not constant on Sj(x) 
3 \ / is a constant on Sj(x) 
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Let x be a random variable that takes values in X n according to p n . Note that a,j(x) 
implies ~Var s .^[f] = 0, and in general since / is Boolean, Var a ( z ) [/] < \. Thus 



If(3) 



E[o ; (:r) 



>4E[Var^. (x) [/]]=4I / (j). 



(5) 



In the discrete case of the uniform measure on {0, 1}™, the equality holds in ([5]), and the 
two definitions differ only in a constant factor of 4. But in the context of general Boolean 
functions, If (J) can be much larger than If (J). For example define / : [0, l] n — > {0,1} as 
f(x) = 1 if Xj > e, and f(x) = otherwise. Then If(j) = 1 while If(j) = e(l — e). 

The well-known result of [KKL88] which is known as the KKL inequality says that for every 
function / : {0, 1}" — > {0, 1}, there exists a variable whose influence is at least Cp(l — p)^ 2 -^ 
where p = E[/] = Pr x [/(x) = 1]. Later Friedgut [Fri04j noticed that the approach of KKL88J 
leads to an interesting result that later found many applications [DS05, KR03, DRS05, OS06J: 

Theorem A. For every f : {0, 1}™ — ► {0, 1}, there exists an approximation g : {0, l} n — > {0, 1} 
with ||/ — #11 ! < e which depends only on e CIf ^ e variables. 



In [BKK+92] . Bour gain et al. generalized the framework of KKL to the continuous domain, 
/ : [0, l] n — > {0,1}, and proved that the KKL inequality holds in this setting as well. The 
inequality in this case is usually called the BKKKL inequality. 

Friedgut and Kalai |FK96j noticed that the proof of [BKK+92 can be modified to imply 
the following statement which is one of our main tools in this article. 

Theorem B. For every f : [0, l] n — > {0, 1} with E[/] = p, there always exists i G [n] such that 

e -CI f /p(l-p) < j f ^y ( g) 

The main tool in the proof of many results about the influences including Theorem [X] 
and Theorem [B] is the Bonami-Beckner inequality. This inequality was originally proved by 
Bonami [Bon70] and then independently by Beckner [Bee 75] . It was first used to analyze 
discrete problems in the proof of the KKL inequality [KKL88j . We will use the inequality in 
the following form. 



Theorem C. [Bonami-Beckner Inequality] Let f : { — l,l} n - 
Walsh expansion f := J2sc\n] f(&)- Then for p > 2, and 5 



be a function with the Fourier- 



< 



and for 1 < p < 2, 



> 



^2 6Wf(S)w s 

SC\n] 



52 o~ lSl f(S)w s 

SQn] 



5 



2.4 Decision trees 

Consider a product probability space X n with the product probability measure fj, n . A decision 
tree T is a tree where every internal node v is labeled with an index i v € [n] such that if u 
is the ancestor of v, then i u ^ i v . Children of v are in a one-to-one correspondence with the 
elements of X. Moreover a value val(u) £ R is assigned to every leaf v of T. The depth of the 
decision tree is the maximum number of edges on a path from the root to a leaf. We call the 
immediate parents of the leaves, the leaf-parents. A decision tree is called complete if all leaves 
are in the same distance from the root. 

Every decision tree T corresponds to a function fx '■ X n — ► R: consider x G X n \ we start 
from the root and explore a path to one of the leaves in the following way: Every time that 
we meet an internal node v, we look at the value x^ v and move to the corresponding child of 
v. We continue until we reach a leaf u. Then f(x) = val(it). The traversed path from the root 
to the leaf is called the computing path of x. 

There are interesting relations between the structure of a decision tree T, and the influences 
of the variables on fx (see for example [OSS05]). The following simple lemma shows that the 
total influence on fx is bounded by the depth of T. 

Lemma 2.3 Consider a decision tree T of depth k and the function fx- Then It T < k. 

Proof. Choose x £ X n at random according to fi n , and % G [n] uniformly at random. If i does 
not appear on the computing path of x, then fx is constant on Si(x). The probability that i 
appears on the computing path is at most k/n. Now the assertion of the lemma follows from 
the linearity of the expectation and the fact that there are n different variables. ■ 



3 Main Results 

3.1 Counter-example to Dinur-Priedgut Conjecture 

The proof of BKKKL inequality and Theorem [B] rely on a simple reduction that reduces the 
problem to the case that / is increasing. Similar but more complicated arguments of this sort 
were previously used by Talagrand in the context of isoperimetric inequalities for Banach-Space 
valued random variable [Tal89]. 

A key but simple observation in [B KK + 92] is that in the case of increasing functions, one 
can without increasing the influences by a factor of more than 2, reduce the problem further 
to the case where the domain of the function is {0, l} m endowed with the uniform probability 
measure, where m = 0(n log n). 

Although Theorem [B] provides a continuous version of the KKL inequality, a continuous 
version of Theorem [A] remained unknown. In fact Dinur and Friedgut (see [Fri04] ) considered 
the case where / is increasing, and observed that in this case the reduction of [B KK+92] to 



{0, l} m shows that / essentially depends on e CI f' e number of variables, or formally for every 
e > there exists a function g : [0, l] n — * {0, 1} which depends on e GIf / € number of variables 
and ||/ — 5 1| I < e. This provides a continuous version of Theorem [Aj but under the extra 
assumption of monotonicity. They conjectured (Conjecture 2.13 in |Fri04j ) that this extra 
condition is unnecessary. The reason that the reduction to increasing functions cannot be used 
here is that with this reduction one may get a function which depends on a small number of 
variables while the original function cannot be approximated by any such function. 
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We disprove Dinur-Friedgut conjecture in the following lemma by introducing a function 
for which the total influence is small, but it cannot be approximated by any function which 
depends on a small number of variables: 

Lemma 3.1 Let k £ N and r > 2 be an even integer, and let \i be the uniform "probability 
measure on X = {0, . . . , r — 1}. For n> r k , there exists a function f : X n — > {0, 1} such that 
If < k and \\f — <?||| > \ for every g : X n — > {0, 1} that depends on at most r k ~ 1 /2 variables. 

Proof. Consider a complete decision tree T of depth k where i v ^ i u for every two internal 
nodes u and v. Consider a leaf w which is the immediate child of a node u, and is corresponded 
to a € {0, . . . ,r - 1}. We let val(w) = L^J • 

By Lemma 12.31 we have If T < k. On the other hand consider a function g : X n — > {0, 1} 
that depends on at most r k ~ 1 /2 variables. Let L be the set of the leaf-parents of T. Note that 
\L\ = r k ~ 1 which shows that for at least half of the nodes v £ L, g is a constant on the children 
of v. This completes the proof because /t / g on half of the children of every such v. ■ 

In the first look Lemma 13.11 may sound disappointing because it shows that the continuous 
version of Theorem [A] is not true. But further inspection shows that indeed a continuous 
version does exist, but it is slightly different from the discrete case. Our counter-example to 
Dinur-Friedgut conjecture has a decision tree of constant depth. We will show that for a fixed 
e, having a constant-depth decision tree is the right reformulation of the conjecture. First we 
"discretize" the problem: 

Remark 3.2 Let / : [0, l] n — * {0, 1} be a function that satisfies the measurability conditions 
of Remark 12.21 Fix any < 5 < jq. Consider m E N and subdivide [0, l] n into m n equal 
size disjoint cells by subdividing each one of the base intervals into m intervals. For every 
x £ [0, l] n denote by C x the cell that contains x. For every 1 < i < n, and cell C, let Si(C) 
denote the set of m cells that are obtained by changing the cell C in the ith coordinate. 
Consider a measurable set A C [0, l] n . We say that a cell C is (1 — 6, A)- determined, if either 
Pr(j4|C) > 1 — 5 or Pr(yl c |C) > 1 — 5. Since A is measurable, for sufficiently large m, more 
than (1 — 5) fraction of the cells will be (1 — 5, ^-determined. For every 1 < i < n, let 

Ai := {x : f is a constant on Si(x)}. 

Since Ai are measurable, one can take m to be sufficiently large so that (1 — 5) fraction of the 
cells are simultaneously (1 — 5, /)-determined and (1 — 5, j4j)-determined, for every 1 < i < n. 
Now we construct a "discrete" approximation h of /, in the following way: 

(i) For every cell C, h(C) = {1}, if E(/|C) > 1 — 5. Furthermore for every such C and every 
1 < i < n, if Pv(Ai\C) >l-5, then h(C) = {1} for every C G Si{C). 

(ii) Similarly for every cell C, h(C) = {0}, if E(l — f\C) > 1 — 5. Furthermore for every such 
C and every 1 < i < n, if Pr^C) > 1 - 5, then h{C) = {0} for every C 6 Si(C). 

(iii) The value of h on the rest of the cells is defined arbitrarily. 

First we need to verify that h is well-defined. Note that for every C in (i) we have E(/|C") > 
1 - 25 > 1/2, while for every C in (ii), E(l - f\C) > 1 - 25 > 1/2. This shows that h is 
well-defined. 
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Note that h is defined so that if C x is (1 — 5, /)-determined and Pi(Ai\C x ) > 1 — 5, then h 
is constant on Si(C x ). Thus 

Ihij) — Pr({x : C x is not (1 — 5, /)-determined}) + Pr({x : C x is not (1 — 5, ^4j)-determined}) 

+Pv({x : ?r{A1\C x ) > 1-5}) < 5 + 5 + = 25 + ^% (7) 

l — o l — o 

Moreover 

Pr[/ ^ h] < Pr({x : C x is not (1 — 5, /)-determined}) + 5Pr({x : C x is (1 — 5, /)-determined}) 
< 25. (8) 

Inequalities ([7]) and ([8]) show that by taking 5 to be arbitrarily small, we can obtain a "discrete" 
approximation h of /, which is arbitrarily close to / in £2 norm, and its total influence is not 
much larger than If. ■ 

Theorem 3.3 Let f : [0, l] n — ► {0, 1} satisfy It < B. Then for every e > there exists a 
function g : [0, l] n — > {0, 1} such that \\f — </||| < e and <? /10s a decision tree of depth at most 

e CB/e\ 

Proof. By Remark 13.21 we can assume that the underlying probability space is the uniform 
probability measure on {0, . . . , r — 1}™ for a sufficiently large r that depends on / and e. 

By a raw decision tree we mean the mathematical object that is obtained by removing the 
values val(v) from the leaves of a decision tree. Consider a function / : {0, . . . , r — l} n — > {0, 1} 
with If < B. For every node u in a raw decision tree, / induces a function /„ of the variables 
that are not assigned a value through the path from the root to v. This function is obtained 
by fixing the values of the variables on the path from the root to v. Denote by m(v) the 
variable with maximum influence in f v , and note that if v is in distance k from the root then 
n(v) = r~ k is the fraction of x S {0, . . . , r — l} ra that v belongs to their computing paths (i.e. 
x agrees with the path from the root to v). 

Let C > 10 be the constant from the exponent in ([6]). Start with a single node as a raw 
decision tree. Consider the following procedure: For every leaf v, we let i v := m{v) and we 
branch on this variable, adding r children to v. Each application of this procedure increases 
the depth of the tree by one. We repeatedly apply the procedure until we obtain a raw decision 
tree T in which the fraction of the leaf-parents v that satisfy If v (iv) > e~ 18C B / e is at most 
e/3. Let S denote the set of the nodes v that satisfy If v {i v ) > e~ 18C " ' B l fl 7 and let L\ C S be 
the set of the leaf-parents in S. We claim that the depth of T is at most e^ 18C +4 )- B / e . 

Suppose to the contrary that this is not true. Consider a node u and note that at node 
u, the i u 'th variable contributes fj,(u) x If u (i u ) to If(i u ). Now consider another node v G T 
such that i v = i u . Since none of u and v is the ancestor of the other, the contribution of the 
i n 'th variable to If(i u ) at node u is disjoint from its contribution at node v. Denoting by h the 
depth of T and by the set of the vertices in distance k from its root, the above discussion 

1 We can assume that C' > 1 as ((6]) remains valid if one increases the constant in the exponent. 
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shows that 



h-l 

!f = ^2^(v)I fv (i v ) = E E PWfvM ^ 

v£T k=0veD k 

> E E Kv)iM>J2me- 18C ' B ^. 

k=0v£D k nS k=0 

Thus h < (3B/e)e 18C ' B ^ 2 + 1 < e ( 18C '+^ B ^. 

It is easy to see that for a similar reason, ^2 V jji{v)If v < If where now the sum is only over 
all leaf-parents of T. This shows that the fraction of the leaf-parents v that satisfy I* v > 3B/e 
is at most e/3. Let L2 denote the set of the leaf-parents that satisfy If v > 3B/e. 

Consider a leaf-parent v L\ U L 2 , i.e. If v (m(f v )) < e ~ lsc ' B / e2 and If v < W/e. By 
Theorem |Bl 

(E[/ v ])(l-E[/„])<e/6, 
which implies that min(E[/,;], 1 — E[/„]) < e/3. Now to every leaf w with parent u we assign 

val(w) - { ° E[fu] ~ 1/2 
val(^) - I x E[/J > 1/2 . 

Let g be the function that is computed by this decision tree. Denote by L the set of all 
leaf-parents and correspond to every x the unique leaf-parent l{x) G L on its computing path. 

\\f-g\\l = Pr[/(x) ^ g{x)\ < Pr[Z(x) G Li] + Pr[/(x) G L 2 ] + 

E min(E[/ tI ], 1 - E[/ v ]) <| + | + |< e - 

ve-L\(IaU£ 2 ) 



3.2 Variational Influences for Boolean functions 

Our next goal is to study the relation between the influences and the variational influences. 
The next theorem shows that when all influences are small then the total variational influence 
is large. One can prove this theorem easily using Theorem 1.5 in [Tal94], but for the sake of 
the completeness we give a direct proof here. 



Theorem 3.4 For f : X n — » M, we have 



v-m^Ei^y. (•) 



Proof. Let ti(x) = f(x) — E Si ( x )[/], and note that If(i) > E[lr t ^o]]; an d If[i>) = ll^lli- Let 
/ = Ylsc\n] Fs be the Fourier- Walsh expansion of /. Note that 



E St{x) [F s } = J F 5 d M (x i ) = { q S \* S S 
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Thus the Fourier- Walsh expansion of Li is the following. 



F s - E St{x) 

SC\n) 



E p s 

SC[n] 



E F * - E F * = E 

SC[n] S:i£S S:i£S 



For 1 < p < 2, by Holder's inequality we have 

!/?//• \ 1/p / r \ 0--pM/p 



u i\\p 



<%) = / ^Wo] < / Wo] < IMIaiX*) 5 



i j_ 

2 



On the other hand by (a generalization of) Theorem ICl we have 

v 1/2 / v 1/2 

E rail) = E 6)s) ^ (p-i)- fe/2 ikiii P - 

v 5:iG5,|S|=fc / \|S|=fc / 

Thus for p = 3/2 we get 

E ^r = Ei E ii^iiBEy//(W) 1/3 <^W/(. 

S:ieS,\S\<k 1 1 *=1 5:ig5,|S|=t t=l 

On the other hand since J2s-ieS II III = we 

Mil < //(t) 



1/3 



and hence 



\s\ - k + V 



E s ^^^i( 2fc+2 ^(W) 1/3 +^ 



Letting = log W we g e ^ 

Summing ([Toll over all i 6 [n], we get (J9]). ■ 

Let us next investigate the properties of the functions that satisfy If < B for some constant 
B. As it is already noticed by Dinur and Friedgut (see [Fri04]) a hyper contr activity argument 
similar to [Fri98j implies that every function / : {0,1,..., r — 1}™ — ► {0,1} with If < B 
essentially depends on r CB l e number of variables. In fact such an argument implies more: 

Lemma 3.5 Let f : {0, 1, ... , r — l} n — ► {0, 1} satisfy If < B. Then for every e > 0, there 
exists a function g : {0, 1, ... ,r — l} n — > {0, 1} which depends on r CB l e number of variables, 
and H/-5II2 ^ e - 

Lemma 13.11 shows that Lemma 13.51 is sharp up to the constant in the exponent. In the 
more general setting of functions / : [0, l] n — * {0, 1}, bounding the total variational influence 
implies less structure on the function. The following example shows that Theorem 13.31 does 
not need to hold when one replaces the bound If < B with If < B. 
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Example 3.6 Let / : [0, If -» {0,1} be defined as /(x) = if and only if x < (l-±, . . . , 

It is easy to see that E[/] ~ 1/e and < 2. Note that / cannot be approximated by a function 

with a decision tree of depth o(n). ■ 

The following theorem which is implied by Bourgain's proof for Proposition 1 in the ap- 
pendix of |Fri99] shows that bounded variational influence implies some weak structure on the 
function. 

Theorem 3.7 Let f : [0, l] n — > {0, 1} satisfy It < B. Then there exists a set J C [n] of size 
at most 10B and an assignment of values to the variables in J such that conditioning on this 
partial assignment, the expected value of f changes by at least 3" 



-500B 2 



To see that why Bourgain's proof implies Theorem 13.71 note that according to Equa- 
tion (2.18) in the appendix of [Fri99| if If < B, then 



E 



> 3- FmB \ (11) 



This immediately implies Theorem 13.71 Inspired by Conjecture 1.5 in [Fri99j, we conjecture 
that Theorem 13.71 can be improved to the following: 

Conjecture 3.8 There exists a function k : R 2 — > R such that the following holds. Let B, e > 
be two constants. For every f : [0, l] n — * {0, 1} that satisfies If < B, there exists a set J C [n] 
of size k(B, e) and an assignment of values to the variables in J such that conditioning on this 
partial assignment, the expected value of f is either less than e or greater than 1 — e. 

3.3 Subcubes of increasing sets 

The following theorem proved in [KKL88J is a straightforward corollary of the KKL inequality. 

Theorem D. There exists a function k : R 2 —* R + such that the following holds. For every 
increasing function f : {0, l} n — ► {0, 1} with Var[/] = p and every e > 0, there exists a set of 
coordinates J C [n] such that \ J\ < k(p, e)nj log n and E[/|J = 1] > 1 — e. 

Although it is wrong fully claimed in [BKK+92] that one can use Theorem |B] to prove 
that the same statement holds for functions / : [0, l] n — > {0, 1}, as it is noticed by Friedgut 
m [Fri04j . the funct ion in Example 13.61 shows that this is not true. 

Friedgut [Fri04] suggested the following conjecture as the continuous version of Theorem [Pi 



Conjecture 3.9 There exists a function k : M 2 — > R such that for every e > 0, lim^^oo ^ = 
0, and the following holds. Let f : [0, 1]™ — ► {0, 1} be an increasing function. There exists a 
set of coordinates J C [n] such that |J| < k(e,n) and either E[/(x)|J = 1] > 1 — e or 
E[f(x)\J = 0]<e. 

The next proposition shows that Conjecture 13.81 implies Conjecture 13.91 

Proposition 3.10 If Conjecture \3.8\ is true, then there exists a function k : R 2 — > R such 
that for every e > 0, linin^oo ^!£M = o, and the following holds. Let f : [0, l] n — ► {0, 1} be an 
increasing function. Then there exists a set of coordinates J C [n] such that \J\ < k(e,n) and 
either E[/(x)| J = 1] > 1 - e or E[f(x)\J = 0] < e. 
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Proof. For a set J C [n], let f\j=\ : [0, l] n — > {0, 1} be defined as /| j=i : x i— ► /(y) where 

_ J 1 i G J 

Fix some -B > 0, and let Jo := 0. For i > 0, inductively obtain Jj+i from Jj as in the 
following. We will use the notation fi = f\j i= \. 

If If. > B, then there exists a variable j £ [n] such that Ifjj) > -B/n. Let Jj+i = J« U {j}. 
Note that for every i/ £ i, := {i : E s .^[fi\ ^ 0}, we have f i+ i(y) = 1. Thus E[/ m ] > 
Efl^eA,]] an d furthermore we have E[/j] = E[E S ( x ) [/j]] . Hence 

E[/i+i] - E[/i] > E[l[ xgA .]] - E[E Sj .( x )[/i]] = Efl^g^.j - E^.^f/j]] 

> E[(E Sj( ,)[/ 4 ])(l - E s . ix) [fi})] = B[V a r Sj(x) [f t ]] = I fi (j) > B/n. 

Continuing in this manner we find a set for some k < n/B such that \J^\ < n/B and 
either E[f\Jf. = 1] > 1 — e or If k < B. But if If k < B, then assuming Conjecture 13.81 we can 
find a set J' of size K(B,e) such that E[f k \J' = 0] < e or E[/ fe |J' = 1] > 1 - e. Note that 
J = J k U J' is of size at most n/B + K,(B,e) and E[/^|J = 0] < e or E[/^|J = 1] > 1 — e. Now 
taking such that k(B, e) ~ n/B completes the proof. ■ 
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